Fast Inverse Reinforcement Learning with Interval Consistent Graph for Driving Behavior Prediction
نویسندگان
چکیده
Maximum entropy inverse reinforcement learning (MaxEnt IRL) is an effective approach for learning the underlying rewards of demonstrated human behavior, while it is intractable in high-dimensional state space due to the exponential growth of calculation cost. In recent years, a few works on approximating MaxEnt IRL in large state spaces by graphs provide successful results, however, types of state space models are quite limited. In this work, we extend them to more generic large state space models with graphs where time interval consistency of Markov decision processes are guaranteed. We validate our proposed method in the context of driving behavior prediction. Experimental results using actual driving data confirm the superiority of our algorithm in both prediction performance and computational cost over other existing
منابع مشابه
Inverse Reinforcement Learning through Structured Classification
This paper adresses the inverse reinforcement learning (IRL) problem, that is inferring a reward for which a demonstrated expert behavior is optimal. We introduce a new algorithm, SCIRL, whose principle is to use the so-called feature expectation of the expert as the parameterization of the score function of a multiclass classifier. This approach produces a reward function for which the expert ...
متن کاملGraph Based Inverse Reinforcement Learning for Social Robot Navigation
Abstract. Mobile robots that operate in human populated spaces are required to act in ways perceived as socially normative with respect to the environment considered. Past approaches have merely focussed on robot centric optimality criteria such as path lengths, heading changes, time to goal etc while ignoring social aspects like personal space intrusions. We learn how to navigate socially from...
متن کاملInverse Reinforcement Learning via Nonparametric Spatio-Temporal Subgoal Modeling
Recent advances in the field of inverse reinforcement learning (IRL) have yielded sophisticated frameworks which relax the original modeling assumption that the behavior of an observed agent reflects only a single intention. Instead, the demonstration data is typically divided into parts, to account for the fact that different trajectories may correspond to different intentions, e.g., because t...
متن کاملSafe, Multi-Agent, Reinforcement Learning for Autonomous Driving
Autonomous driving is a multi-agent setting where the host vehicle must apply sophisticated negotiation skills with other road users when overtaking, giving way, merging, taking left and right turns and while pushing ahead in unstructured urban roadways. Since there are many possible scenarios, manually tackling all possible cases will likely yield a too simplistic policy. Moreover, one must ba...
متن کاملBackward inhibitory learning in honeybees: a behavioral analysis of reinforcement processing.
One class of theoretical accounts of associative learning suggests that reinforcers are processed according to learning rules that minimize the predictive error between the expected strength of future reinforcement and its actual strength. The omission of reinforcement in a situation where it is expected leads to inhibitory learning of stimuli indicative for such a violation of the prediction. ...
متن کامل